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Description 
Field of Invention 

This is an invention in the area of pattern recognition. It is a general method 
for matching patterns held in computer memory, where a pattern is represented 
by a set of spatially or topologically arranged nodes each with an associated 
measurement vector. 

Description of Current Techniques 

. There are a multitude of matching techniques. These may be split into two 
broad categories: gradient-based methods and exhaustive search. Examples of 
the former include gradient descent, simulated annealing, relaxation Labeling, 
neural networks and genetic algorithms. All of these work by taking just a few 
initial best guess match solutions and refining them in order to obtain better 
solutions. The second category is exhaustive search. Here, a large number 
of match solutions are examined by sampling the solution space, and the best 
chosen. The forerunner of exhaustive search techniques is the fast access method 
called geometric hashing. 

Problems with Current Methods 

There are problems associated with both of the above categories. In short they 
are slow and give poor performance on non-trivial matching problems. There are 
reasons for this poor performance. Gradient-based methods depend critically on 
obtaining a good initial match, but this is obviously not possible in general since 
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having a good match is the aim in the first place. On the other hand exhaustive 
search methods are dependent on the resolution with which the solution space is 
searched. For matching the space is exponential in the number of nodes, making 
it very difficult to find a good solution in a reasonable time. 

Overview of New Solution 

We propose a new approach to matching which is fast and gives good perfor- 
mance. The approach stems from a new philosophy to pattern recognition based 
upon four key conditions: 

• Condition 1 

Matching is formulated as one of finding the best set of transformations 
between the nodes in two patterns. 

• Condition 2 

Calculations are underpinned by Bayesian probability theory. 

• Condition 3 

The method is holistic in that it requires that all possible solutions must 
be examined. 

• Condition 4 

Processing is resource-driven such that the calculations that can be per- 
formed are constrained by the memory available and the speed of opera- 
tions required, as defined by the operator. 

Conditions 3 and 4 lead to a conundrum: how to look at an exponential 
number of solutions quickly and efficiently. This is achieved by collecting solu- 
tions together into a small number of groups, and assessing each group in turn. 
There are a number of estimates that may be made on a group, but a strategy 
that is effective and consistent with condition 4 since we can trade-off speed for 
accuracy is to obtain upper and lower bound scores (probabilities) for any so- 
lution contained therein. Given these bounds the strategy to take is: eliminate 
groups of solutions if their upper bound falls below the highest lower bound. 
This guarantees that the optimal solution will be retained. By repeating this 
operation we can hone in on interesting regions of the solution space by ex- 
cluding sub-optimal solutions. The remaining solutions may be re-examined in 
increasing detail as processing proceeds and as condition 4 allows. The process 
terminates when all upper bounds exceed the lower bound threshold. At this 
point the lower bound may be heuristically increased to re-start the elimination 
process, or alternatively the remaining transformations may be recorded and 
processed in some way. 
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Detailed Description of New Solution 

Consider a pattern labelled by a set of TV nodes. The nodes have an associated 
set of measurement vectors, z = {x\ , . . . , xjv}. 

Suppose that the set of transformations for the nodes is denoted by w = 
{tui, . . . ,wn], say. From condition 1 the aim is to find the best global solution, 
i.e., the best set of transformations from the nodes in this pattern to a second 
pattern, where, from conditions 2 and 3 we adopt an holistic, probability theory 
approach, requiring: 

w = arg max P(u> = w I x) (1) 

where W is the space of possible solutions. We do not realise this aim by 
directly, i.e., by actively searching for and refining solutions within W , this 
being the approach of existing gradient-based or exhaustive search techniques. 
Rather, we do so indirectly, by eliminating bad solutions from W. In doing so 
we implicitly examine all of the solution space, as required by condition 3, as 
follows. 

We begin by grouping solutions together since examining each individual 
solution in isolation would be computationally intractible in general, thereby 
breaking condition 4. Consider all solutions that contain the individual trans- 
formation u)i = q, say. The lowest upper bound on any one of these solutions 
is such that: 

U(tVi = a) = max P(tUj = a, w | x) (2) 

where w denotes the solutions on all nodes excluding that under considera- 
tion, and W is the space of possible solutions for this set. 

Now any group of solutions whose upper bound probability is below some 
known lower bound value, L, say, of interest cannot contain the optimum solu- 
tion. Therefore, we can eliminate these groups from consideration. Therefore 
the rule at some iteration time n is: 

eliminate any solution containing the transformation W{ = a if 

U^ n Hwi = a) < Lf"> (3) 

This is the basis of the method: an upper bound on the probability of a group 
of solutions can be computed and compared against a lower bound threshold. 
If the upper bound falls below the threshold the group can be eliminated. 

The computation of the upper bound has not yet been defined, and in general 
may be compuationally expensive, thereby breaking condition 4. The solution 
is to identify quantities of the form (wi = a) such that (wi = a) > 
U^(wi — a) which can be computed in a given time. The elimination rule 
then becomes: 

eliminate any solution containing the transformation Wi = a if 

f/< n > (Wi = a) < (wi =<*) < (4) 
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is evaluated by combining Bayesian probability theory with rules of 
inequality. Its form may change over the iterative cycles in order to accommo- 
date condition 4. For example, at the onset of processing G^ may be coarsely 
and quickly evaluated, but provided it obeys G^ > then only bad trans- 
formations will be eliminated. Towards the end of processing when only a few 
solutions remain, a more sophisticated and computatioanlly intensive means of 
computing G may be employed, such that G^ ~ U^ n \ provided condition 4 is 
not violated. 

Processing will continue until no solutions fall below the threshold. At any 
time processing may be re-started by heuristically increasing the threshold, or 
alternatively, the remaining transformations may be recorded and processed in 
some manner. 

An Example: Matching in Chemical Databases 

An example use of the method is retrieval of bio-active compounds from chemi- 
cal databases by using one or more query or lead compounds a cue. The starting 
point is to represent query and database compounds as patterns, each identi- 
fied by a set of spatially or topologically arranged nodes, each node having an 
associated measurement vector. 

We can develop the upper bound quantities. By applying Bayes' rule (2) 
becomes 

U (wi = a) = maxp(x \wi~ a, w)P(wi = a,w)/p(x) (5) 

Making the non-restrictive assumption that the measurement vectors are 
independent when conditioned on the transformations then this becomes 

U(wi =a)= p(xi | Wi = a)P(wi = a) max{TTp(a: i | Wj)}P{w | w { = a)/p(x) 

*** jA 

(6) 

Now introduce an inequality to reduce computational complexity. An option 
is maxoeA&eB-Pfofc) < mox o€> iP(o) max^eB which gives 

U(wi = a) < 

p{xi | Wi = a)P(wi = a){Yljjti max^ewi pfrj \ wj = 0) 

P(wj =/3\wi= <x)}/ P (x) (7) 

where Wj is the set of possible transformations for node j, and which reduces 
the complexity of the upper bound calculation from exponential to 0(N*). Al- 
ternative inequalities could be applied here leading to increases or decreases in 
complexity, as required. 

Now a current possibility can be eliminated as sub-optimal if its upper bound 
as computed in (5) falls below some determined lower bound threshold. For 
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example, if we have identified a solution and its probability at some time we can 
use this to set the lower bound. This gives a matching-by-exclusion algorithm 
in which regions of the solution space are iteratively pruned away. As a possible 
transformation on a node is eliminated at one time, so this affects the support 
computed for possibilities on other nodes at the next iteration. 

The algorithm can be applied to all candidate transformations at all nodes, 
synchronously or asynchronously, and can be expressed as: 

eliminate the transformation w i = a from the list W r / n+1 * if 

p( Xi | Wi = a)P(wi = a){JJ max p( Xj \ w j = 0)P(wj = 0\wi = a)}< A^ n) 

(8) 

where Aj n * is the threshold value, and n is the time index. 

Suppose that we take logarithms. The elimination rule then becomes 

eliminate the transformation Wi = a from the list w£ n+1 ^ if 

S<">( Wi = a)<IogA< n) (9) 

where S^ (wi = a) counts the number of nodes that may be consistent with 
the assignment at node i: 

S( n Ht»i = a) = 

log(p(ari | wi — a)P(wi — a)) + 
max^^, log(p(xj | wj = 0)P(wj = /J | tu 4 = a)) , (10) 

Application of the method in requires models for the distributions and priors 
in (10). For the application of compound matching one alternative is rectilinear 
distributions with zero height away from their centre. In this case the support 
for an individual transformation is: 

S^Hwi = a) = kY* max h(w* = a t wj = 0) (11) 

for n > 0, where k is a constant and where all solutions not compatible 
with the data have been eliminated at the onset. Here h(wi = o, wj = 0) is a 
binary comptibility measure, simply stating if the transformation a on node i 
is compatible with the solution 0 on node j at time n. 

The procedure can combine the algorithm in (9) with geometric hashing Q. 
It involves a storage stage in which database compounds are encoded in a hash 
table, and a recall stage in which a query compound is used to access the table, 
and plausible transformations are examined. Finally, a clustering stage may be 
added to refine remaining solutions. 
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Storage 

The following steps are taken in storage for each database compound: 

• Generate the database compound nodes, and their measurement vectors 
to include node position and normal 

• Generate a frame for each point using the centroid-position- normal triplet 

• Align this frame to the world frame and store the compound in a hash 
table as compound-node-transformation triplets 

Recall 

The following steps are taken in recall: 

• Generate the query compound to define the object nodes, their positions 
and normals 

• Generate a frame for each node using the centroid-position-normal triplet 

• Align this frame to the world frame and access the hash table, assigning 
accessed transformations to each node 

• Convert the transformation matrices to rotation parameters and store in 
a hash table 

• Use the matching- by-elimination procedure in (9) to eliminate implausible 
rotation solutions 

• Cluster the remaining solutions and obtain a similarity index score for 
each by overlaying compounds 

Modifications 

Modifications to description above occur at the level of modeling. This may 
either be alterations to the form of the distributions assumed or to the mea- 
surement features employed. For example, in the molecule matching we have 
used rectilinear distributions but in this and other examples Gaussian distri- 
butions, say, may be appropriate and, for example, curvature information may 
have been employed. 

Under different models and using different measurements there are a number 
of application areas for the technique: 

• Medical image analysis 

• Visual inspection and control 

• DNA and protein sequence matching 

• Financial prediction 
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Claims 

• 1. A computer system of one or more processors for matching patterns by 
an exam ini nation of all possible transformations between patterns, com- 
prising: 

— a database of one or more patterns where each pattern is identified 
by a set of spatially or topologically arranged nodes, where each node 
has an associated measurement vector, the database being stored in 
one or more memories that are accessible to the processors; 

— a query pattern identified by a set of spatially or topologically ar- 
ranged nodes, where each node has an associated measurement vector 
and an associated memory to store a set of possible transformations; 

— a method A for initialising the set of possible transformations on each 
node in the query pattern; 

— an iterative process for eliminating possible transformations, where 
each iteration comprises, 

a method B for computing an upper bound probability, U t on all 
solutions that contain a particular transformation in the set at a 
particular node; 

a method C for computing a lower bound probability, L; 
a thresholder for eliminating from the sets at each node transforma- 
tions for which U < L. 

a halting procedure for stopping the iterative process. 

— a recorder for recording the transformations that remain. 
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• 2. A system, as in claim 1, where method B and method C may be designed 
or altered at any time so as to either increase or decrease the the time taken 
for processing to terminate, as required by the system operator, including 
the possibility of designing or altering method B and method C to be of 
constant complexity with the" effect that the complexity of processing is 
then 0{N^), where N is the number of nodes in the database patterns. 

• 3. A system, as in claim 2, where further, method A, method B and 
method C are stipulated using Bayesian probability theory. 

• 4. A system, as in claim 3, capable of accessing one or more objects from 
a database using a query of one or more objects, by a process of: 

— representing each object in the database as a pattern where each 
pattern is identified by a set of spatially or topologically arranged 
nodes, where each node has an associated measurement vector, the 
database being stored in one or more memories that are accessible to 
the processors; 

— selecting a query object from the query set; 

— representing the query object as a query pattern identified by a set 
of spatially or topologically arranged nodes, where each node has an 
associated measurement vector and an associated memory to store a 
set of possible transformations; 

— identifying the best transformations of the nodes in the query pat- 
tern as in claim 3, and optionally grouping these transformations by 
some averaging or clustering method, and subsequently assessing the 
probability or some similarity score of these remaining transforma- 
tions; 

— recording those set of transformations and similarity scores. 

• 5. A system, as in claim 4, capable of discovering novel bio-active com- 
pounds from a database using a query set of one or more lead compounds 
by representing database and query compounds as patterns, and for which 
method B and method C can be defined to be of variable complexity, in- 
cluding constant complexity for each node, leading to a complexity for 
processing of 0(N^) for each compound-compound comparison, and pro- 
viding for increases in the speed of analysis in excess of 2-3 orders of 
magnitude compared with existing techniques for discovering bio-active 
compounds. 

• 6. A system, as in claim 4, capable of the identification of multi-dimensional 
objects at any degree of translation or rotation in image data, including 
the identification of advertising brands or logos in television or video data, 
for the purpose of registration or control, by representing the objects as 
patterns, and for which method B and method C can be defined to be of 
variable complexity, including constant complexity for each node, leading 
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to a complexity for processing of 0(N 2 ) and enabling the identification to 
be achieved in real time. 
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